Dynamic programming based segmentation approach to LSF matrix reconstruction
نویسندگان
چکیده
We propose a methodology of speech segmentation in which the LSF feature vector matrix of a segment is reconstructed optimally using a set of parametric/non-parametric functions. We have explored approximations using basis functions or polynomials. We have analyzed the performance of these methods w.r.t. phoneme segmentation (on 100 TIMIT sentences) and reconstruction error based on spectral distortion (SD) measure. We study how amenable these methods are to quantization and their suitability for speech coding. We also estimate the optimum number of segments depending on the reconstruction performance achieved using that many number of segments and the tolerance limit set on the spectral distortion error.
منابع مشابه
Dynamic programming approach to voice transformation
This paper presents a voice transformation algorithm which modifies the speech of a source speaker such that it is perceived as if spoken by a target speaker. A novel method which is based on dynamic programming approach is proposed. The designed system obtains speaker-specific codebooks of line spectral frequencies (LSFs) for both source and target speakers. Those codebooks are used to train a...
متن کاملVoice transformation using principle component analysis based LSF quantization and dynamic programming approach
The goal of voice transformation (VT) is to modify the speech of a source speaker such that it is perceived as if spoken by a target speaker. In this paper, we present a speaker specific line spectral frequency (LSF) quantization based on principle component analysis (PCA) and k-means clustering for VT. An LPC based source-filter model is used to model the speech. Transformation is applied to t...
متن کاملA DYNAMIC PROGRAMMING APPROACH TO CONTEXT−FREE VOICE TRANSFORMATION (MonAmOR3)
In this paper, we present a dynamic programming approach to voice transformation (VT). The goal of VT is to modify the speech of a source speaker such that it is perceived as if spoken by a target speaker. The speech model used in this work is based on MELP (Mixed Excitation Linear Prediction) speech coding algorithm. The designed system obtains speaker−specific codebooks of line spectral frequ...
متن کاملA DSS-Based Dynamic Programming for Finding Optimal Markets Using Neural Networks and Pricing
One of the substantial challenges in marketing efforts is determining optimal markets, specifically in market segmentation. The problem is more controversial in electronic commerce and electronic marketing. Consumer behaviour is influenced by different factors and thus varies in different time periods. These dynamic impacts lead to the uncertain behaviour of consumers and therefore harden the t...
متن کاملAn interval-valued programming approach to matrix games with payoffs of triangular intuitionistic fuzzy numbers
The purpose of this paper is to develop a methodology for solving a new type of matrix games in which payoffs are expressed with triangular intuitionistic fuzzy numbers (TIFNs). In this methodology, the concept of solutions for matrix games with payoffs of TIFNs is introduced. A pair of auxiliary intuitionistic fuzzy programming models for players are established to determine optimal strategies...
متن کامل